AMEN DMENTS TQ THE CLAIMS: 

The following listing of claims replaces all prior versions of the claims. 

1. (Currently Amended) A method for tolerating writing variations in 
input data when processing a data record for finding a counterpart in a reference data 
set, the method comprising the steps of: 

determining in the data record a value of a data field, the data field representing 
an identifier, 

determining, by a processor, from a set of predetermined identifier values at least 

one synonym candidate for the value of the data field using a candidate selection 
criterion , 

determining if a synonym candidate and the value of the data field fulfill a 
predetermined synonym acceptance criterion based on at least one quality parameter, 
wherein said at least one quality parameter takes into account writing variations that are 
evaluated based on differences in the value of the data field and the synonym 
candidate, and when [[if]] the predetermined synonym acceptance criterion is fulfilled, 
associating the value of the data field and the synonym candidate as synonyms and 
automatically updating a synonym set representing known writing variations for the 
identifier in a computer readable database and referencing to respective entries in the 
reference data set by adding the value of the data field to the synonym set without 
intervention of a user before searching for a counterpart, and 

searching for the counterpart for the data record by comparing the value of the 
data field to entries of the reference data set and/or the synonym set after the step of 
determining if the predetermined synonym acceptance criterion is fulfilled, wherein, if 
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the synonym set was updated, said comparison to the synonym set comprises 
comparison to the updated synonym set in the computer readable database. 

2. (Original) A method as defined in claim 1, wherein the at least one synonym 
candidate is determined using a candidate selection criterion depending at least on the 
value of the data field and on a synonym candidate. 

3. (Original) A method as defined in claim 2, wherein, the candidate selection 
criterion takes into account how similar a synonym candidate and the value of the data 
field sound. 

4. (Original) A method as defined in claim 2, wherein the candidate selection 
criterion specifies that at least a predetermined part of the value of the data field is 
identical to a predetermined part of a synonym candidate. 

5. (Currently Amended) A method as defined in any on e of claims claim 2-t©-4, 
wherein the candidate selection criterion takes into account also a further data field of 
the data record, said further data field representing a second identifier. 

6. (Canceled) 

7. (Currently Amended) A method as defined in claim 1 [[6]], wherein at least 
one quality parameter takes into account at least one of the following quantities: 

a number of changes required for converting the value of the data field to be 

identical to a synonym candidate; a proportion of identical characters in the value of the 

data field and in a synonym candidate; and a difference between the length of the value 

of the data field and the length of a synonym candidate. 

3 

Application No. 10/559,386 
Attorney Docket No. 108800-00007 

837788 



8. (Original) A method as defined in claim 7, wherein the number of changes 
required for converting the value of the data field to be identical to a synonym candidate 
is calculated using the Levenshtein distance. 

9. (Original) A method as defined in claim 7, wherein the proportion of identical 
characters takes into account the order of the characters. 

1 0. (Currently Amended) A method as defined in any one of cla ims claim 1 §-te 
0, wherein a first quality parameter is evaluated for each synonym candidate and at 
least a second quality parameter is evaluated at least for the synonym candidate(s) 
having the best first quality parameter. 

1 1 . (Currently Amended) A method as defined in any one of c l aims claim 1 64o 
9, wherein the synonym acceptance criterion requires that there is only one synonym 
candidate having the best at least one quality parameter. 

12. (Currently Amended) A method as defined in claim 1 [[6]], wherein at least 
two quality parameters are evaluated for each synonym candidate and the synonym 
candidate acceptance criterion specifies a threshold for one of the at least two quality 
parameters, the threshold being dependent on a further one of the at least two quality 

parameters. 

13. (Previously Presented) A method as defined in claim 1, wherein the search 

for the counterpart involves comparison of the value of the data field to a synonym set 

relating to the identifier, members of said synonym set referring to respective 

predetermined identifier values, and when the predetermined synonym acceptance 

criterion is fulfilled, the value of the data field is added to the synonym set as a member 
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referring to the synonym associated with the value of the data field before the search for 
the counterpart. 

14. (Previously Presented) A method as defined in claim 1, wherein determining 
the at least one synonym candidate is discarded, if a predetermined discard criterion is 
fulfilled. 

15. (Original) A method as defined in claim 14, wherein the predetermined 
discard criterion specifies that the value of the data field is identical to one of the 

predetermined identifier values. 

16. (Original) A method as defined in claim 14, wherein the search for the 
counterpart involves the synonym set and the predetermined discard criterion specifies 
that the value of the data field is at least one of the following: one of the predetermined 

identifier values, and a member of the synonym set. 

17. (Currently Amended) A method as defined in any one of c l aims claim 14 to 
4§, wherein the predetermined discard criterion takes into account a value of a second 
data field in the data record. 

18. (Previously Presented) A method as defined in claim 1, wherein information 
indicating the at least one synonym associated with the value of the data field is added 
to the data record. 

19. (Original) A method as defined in claim 18, wherein a copy of the data record 
is made for each synonym associated with the value of the data field. 
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20. (Previously Presented) A method as defined in claim 1, wherein the identifier 
relates to a name of one of the following: a geographical entity, a person and an 
organization. 

21. (Currently Amended) A method of updating ^ocessiR§ a synonym set 
stored in a computer readable database to tolerate writing variation in input data when 
the synonym set is used in searching for counterparts in a r efe renc e data set for data 
records, wherein a data record contains containing a data field representing an 
identifier, and members of the synonym set are be i ng first identifier values [[and]] 
referring to respective second identifier values, the second identifier values being 
predetermined identifier values, a nd said searching for a cou nte r pa r t involving 
comp a rison of a-~vakie-~of4he^ t o the synon ym set i n the computer r eadable 
databas e 7 the method of updating the synonym set comprising the steps of: 

determining, by a processor, among the predetermined identifier values at least 
one synonym candidate relating to the value of the data field in the data record using a 
candidate detection criterion, an4 7 

determining if the value of the data field and a synonym candidate fulfill a 
predetermined synonym acceptance criterion based on at least one quality parameter, 
wherein said at least one quality parameter takes into account writing variations that are 
evaluated based on differences in the value of the data field and the synonym 
candidate, and 

when the predetermined synonym acceptance criterion is fulfilled, automatically 

adding befefe-seaf ching for a counterpart for a data record from the syn o nym set the 

value of the data field to the synonym set in the computer readable database as a 
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member referring to the synonym candidate without intervention of a user and before 
the synonym set is used in searching for a counterpart for the data record from the 

synonym set . 

22. (Original) A method as defined in claim 21 , wherein the synonym set is empty 
before adding the value of the data field to the synonym set. 

23. (Original) A method as defined in claim 21 , wherein the synonym set contains 
at least one member before adding the value of the data field to the synonym set. 

24. (Currently Amended) A computer-readable record medium having stored 
thereon computer-executable instructions for causing a computer to perform a method 
for tolerating writing variations in input data when processing a data record for finding a 
counterpart in a reference data set, the method comprising the steps of: 

determining in the data record a value of a data field, the data field representing 
an identifier, 

determining from a set of predetermined identifier values at least one synonym 
candidate for the value of the data field, 

determining if a synonym candidate and the value of the data field fulfill a 
predetermined synonym acceptance criterion based on at least one quality parameter, 
wherein said at least one quality parameter takes into account writing variations that are 
evaluated based on differences in the value of the data field and the synonym 
candidate, and when [[if]] the predetermined synonym acceptance criterion is fulfilled, 
associating the value of the data field and the synonym candidate as synonyms and 
automatically updating a synonym set representing known writing variations for the 
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identifier and referencing to respective entries in the reference data set by adding the 
value of the data field to the synonym set without intervention of a user before searching 
for a counterpart, and 

searching for a counterpart for the data record by comparing the value of the 
data field to entries of the reference data set and/or the synonym set after the step of 
determining if the predetermined synonym acceptance criterion is fulfilled, wherein, if 
the synonym set was updated, said comparison to the synonym set comprises 
comparison to the updated synonym set. 

25. (Currently Amended) A computer-readable record medium having stored 
thereon computer-executable instructions for causing a computer to perform a method 
for updating processing a synonym set to tolerate writing variation in input data when 
the synonym set is used in searching for counterparts in- a reference data s et for data 
records, wherein a data record contains conta ini ng a data field representing an 
identifier, and members of the synonym set are being first identifier values [[and]] 
referring to respective second identifier values, the second identifier values being 
predetermined identifier values, a nd sa i d searc hi ng for a— counterpart involving 
comparison of a value of th e data field t o th e s yno n ym set, the method of updating the 
synonym set comprising the steps of: 

determining among the predetermined identifier values at least one synonym 
candidate relating to the value of the data field in the data record using a candidate 
selection criterion, [[and,]] 

determining if the value of the data field and a synonym candidate fulfill a 

predetermined synonym acceptance criterion based on at least one quality parameter, 
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wherein said at least one quality parameter takes into account writing variations that are 

evaluated based on differences in the value of the data field and the synonym 
candidate, and 

when the predetermined synonym acceptance criterion is fulfilled, automatically 
adding befomsea^Rmg for a~counterpaft for a da ta r e cord from t h e syn onym-set-the 
value of the data field to the synonym set as a member referring to the synonym 
candidate without intervention of a user and before the synonym set is used in 
searching for a counterpart for the data record from the synonym set . 

26. (Currently Amended) A data processing system comprising a processor for 

tolerating writing variations in input data when processing data records for finding 

counterparts in a reference data set, the system comprising: 

means for receiving data records, 

memory means for storing the reference data set, 

means for storing predetermined identifier values for an identifier, 

means for determining in the data records values of a data field, the data field 

representing the identifier, 

means for associating values of the data field and respective predetermined 

identifier values as synonyms, said means configured to determine from the 

predetermined identifier values at least one synonym candidate for a value of the data 

field, to determine if a synonym candidate and the value of the data field fulfill a 

predetermined synonym acceptance criterion based on at least one quality parameter, 

wherein said at least one quality parameter takes into account writing variations that are 

evaluated based on differences in the value of the data field and the synonym 
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candidate, and when [[if]] the predetermined synonym acceptance criterion is fulfilled, to 
associate the value of the data field and the synonym candidate as synonyms and to 
automatically add the synonym candidate to a synonym set representing known writing 
variations for the identifier and referencing to respective entries in the reference data set 
without intervention of a user before searching for a counterpart to provide an updated 
synonym set, and 

means for searching counterparts in the reference data set for the data records 
by comparing values of data fields to entries of the reference data set and/or said 

updated synonyms set. 

27. (Original) A data processing system as defined in claim 26, further comprising 
means for storing a synonym set, members of said synonym set referring to 
respective predetermined identifier values, 

wherein the means for associating values of the data field and respective 
predetermined identifier values as synonyms are configured to add to the synonym set 
a member referring to the synonym associated with the value of the data field before 
activation of the means for searching counterparts. 

28. (Currently Amended) A data processing system comprising a processor 
configured to update a synonym set stored in a computer readable database to tolerate 
writing variation in input data when the processing a synonym set is used in [[for]] 
searching for counterparts i n a r e f e renc e data-set for data records, wherein a data 
record contains compris i n g a data field representing an identifier, and members of the 
synonym set are bemg first identifier values an4 referring to respective second identifier 

10 

Application No. 10/559,386 
Attorney Docket No. 108800-00007 

837788 



values, said second identifier values being predetermined identifier values , and said 
sea rc hi ng invotvmg comparing-a-value of the data field to the synonym-sety the system 
comprising: 

memory means for storing the synonym set, 

means for storing predetermined identifier values for the identifier, 

means for receiving data records, 

means for determining in the data records values of the data field, and 
updating means for automatically adding to the synonym set a value of the data 
field and respective predetermined identifier values associated as synonyms 
without intervention of a user and before the synonym set is used for searching 
for counterparts for the data record from the synonym set, wherein in the 
r e f e renc e d a ta set, said updating means are configured to determine from the 
predetermined identifier values at least one synonym candidate for the [[a]] value 
of the data field, to determine if a synonym candidate and the value of the data 
field fulfill a predetermined synonym acceptance criterion based on at least one 
quality parameter, w h e r e in said at least one quality parameter taking takes into 
account writing variations that are evaluated based on differences in the value of 
the data field and the synonym candidate, and when [[if]] the predetermined 
synonym acceptance criterion is fulfilled, to associate the value of the data field 
and the synonym candidate as synonyms. 

29. (Currently Amended) A data processing apparatus, comprising: 
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at least one processor configured to tolerate writing variations in input data when 
processing data records for finding counterparts in a reference data set, to determine in 
the data records values of a data field, the data field representing an identifier, to 
associate values of the data field and respective predetermined identifier values as 
synonyms, to determine from the predetermined identifier values at least one synonym 
candidate for a value of the data field, to determine if a synonym candidate and the 
value of the data field fulfill a predetermined synonym acceptance criterion based on at 
least one quality parameter, wherein said at least one quality parameter takes into 
account writing variations that are evaluated based on differences in the value of the 
data field and the synonym candidate, and when [[if]] the predetermined synonym 
acceptance criterion is fulfilled, to associate the value of the data field and the synonym 
candidate as synonyms and to automatically add the value of the data field to a [[the]] 
synonym set representing known writing variations for the identifier and referencing to 
respective entries in the reference data set to provide an updated [[a]] synonym set 
without intervention of a user before searching for counterparts, to store the updated 
synonym set, and to search the counterparts in the reference data set for the data 
records by comparing the data records to entries of the reference data set values of 
data fields and/or said updated synonym set. 

30. (Previously Presented) A data processing apparatus as defined in claim 
29, comprising at least one memory configured to store a synonym set, members of 
said synonym set referring to respective predetermined identifier values, and wherein 
the at least one processor is configured to add to the synonym set stored in the at least 
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one memory a member referring to the synonym associated with the value of the data 
field before activation of the search for counterparts. 

31 . (Currently Amended) A data processing apparatus comprising: 
a processor configured to update a synonym set stored in a computer readable 
database to tolerate writing variations in input data when the synonym set is used in 
searching for counterparts for a data record, wherein the processor is configured to 
cause the update by determining de t er m ine from [[a]] predetermined identifier values at 
least one synonym candidate for a value of a data field of data record , to determ i ne 
determining if a synonym candidate and the value of the data field fulfill a predetermined 
synonym acceptance criterion based on at least one quality parameter, wherein said at 
least one quality parameter takes into account writing variations that are evaluated 
based on differences in the value of the data field and the synonym candidate, and 
when [[if]] the predetermined synonym acceptance criterion is fulfilled, associating to 
as s oc i at e the value of the data field and the synonym candidate as synonyms, and 
thereafter [[to]] automatically adding [[add]] to a synonym set representing known writing 
variations for the identifier , referencing to respective entries in the reference data set 
and stored in a memory a value of the data field and respec ti ve p r ed e t e r m in e d i d entifier 
va lu es assoc iate d a s synonyms without intervention of a user to update the synonym 
set before use of the synonym set by input i nto a searching system configured to search 
for counterparts m th e reference dat a-set by comparing the [[a]] value of the data field to 
the updated synonym set. 
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